Densest Subgraph in Streaming and MapReduce

نویسندگان

  • Bahman Bahmani
  • Ravi Kumar
  • Sergei Vassilvitskii
چکیده

The problem of finding locally dense components of a graph is an important primitive in data analysis, with wide-ranging applications from community mining to spam detection and the discovery of biological network modules. In this paper we present new algorithms for finding the densest subgraph in the streaming model. For any > 0, our algorithms make O(log1+ n) passes over the input and find a subgraph whose density is guaranteed to be within a factor 2(1 + ) of the optimum. Our algorithms are also easily parallelizable and we illustrate this by realizing them in the MapReduce model. In addition we perform extensive experimental evaluation on massive real-world graphs showing the performance and scalability of our algorithms in practice.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Applications of Uniform Sampling: Densest Subgraph and Beyond

Recently [Bhattacharya et al., STOC 2015] [2] provide the first non-trivial algorithm for the densest subgraph problem in the streaming model with additions and deletions to its edges, i.e., for dynamic graph streams. They present a (0.5−ǫ)-approximation algorithm using Õ(n) space, where factors of ǫ and log(n) are suppressed in the Õ notation. However, the update time of this algorithm is larg...

متن کامل

Streaming data analytics via message passing with application to graph algorithms

The need to process streaming data, which arrives continuously at high-volume in real-time, arises in a variety of contexts including data produced by experiments, collections of environmental or network sensors, and running simulations. Streaming data can also be formulated as queries or transactions which operate on a large dynamic data store, e.g. a distributed database. We describe a lightw...

متن کامل

Efficient Primal-Dual Algorithms for MapReduce

In this paper, we obtain improved algorithms for two graphtheoretic problems in the popular MapReduce framework. The first problem we consider is the densest subgraph problem. We present a primal-dual algorithm that provides a (1 + ) approximation and takes O( logn 2 ) MapReduce iterations, each iteration having a shuffle size of O(m) and a reduce-key-complexity of O(dmax). Here m is the number...

متن کامل

An algorithmic framework for fixed-cardinality optimization in sparse graphs applied to dense subgraph problems

We investigate the computational complexity of the Densest k-Subgraph problem, where the input is an undirected graph G = (V,E) and one wants to find a subgraph on exactly k vertices with the maximum number of edges. We extend previous work on Densest k-Subgraph by studying its parameterized complexity for parameters describing the sparseness of the input graph and for parameters related to the...

متن کامل

Finding large and small dense subgraphs

We consider two optimization problems related to finding dense subgraphs, which are induced subgraphs with high average degree. The densest at-least-k-subgraph problem (DalkS) is to find an induced subgraph of highest average degree among all subgraphs with at least k vertices, and the densest at-most-k-subgraph problem (DamkS) is defined similarly. These problems are related to the well-known ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • PVLDB

دوره 5  شماره 

صفحات  -

تاریخ انتشار 2012